Response / Target Variable

Property_Sale_Price

0) Dwell_Type

Good Predictor

1) Zone_Class

Good Predictor

2) LotFrontage

Not good predictor

3) LotArea

Not a good Predictor

4) Road_Type

Good Predictor

5) Alley

Good Predictor

6) Property_Shape

Good Predict

7) LandContour

Good Predictor

8) Utilities

Not Good Predictor

9) LotConfig

Good Predictor

10) LandSlope

Not Good Predictor

11) Neighborhood

Good Predictor

12) Condition1

Good Predictor

13) Condition2

Good Predictor

14) Dwelling_Type

Good Predictor

15) HouseStyle

Good Predictor

16) OverallQual

Good Predictor

17) OverallCond

Good Predictor

18) YearBuilt

Not Good Predictor

19) YearRemodAdd

Not Good Predictor

20) RoofStyle

Good Predictor

21) RoofMatl

Good Predictor

22) Exterior1st

Good Predictor

23) Exterior2nd

Good Predictor

24) MasVnrType

Good Predictor

25) MasVnrArea

Not Good Predictor

26) ExterQual

Good Predictor

27) ExterCond

Good Predictor

28) Foundation

Good Predictor

29) BsmtQual

Good Predictor

30) BsmtCond

Good Predictor

31) BsmtExposure

Good Predictor

32) BsmtFinType1

Good Predictor

33) BsmtFinSF1

Not Good Predictor

34) BsmtFinType2

Good Predictor

35) BsmtFinSF2

Not a Good Predictor

36) BsmtUnfSF

Not a Good Predictor

37) TotalBsmtSF

Good Predictor

38) Heating

Good Predictor

39) HeatingQC

Good Predictor

40) CentralAir

Good Predictor

41) Electrical

Good Predictor

42) FirstFlrSF

Good Predictor

43) SecondFlrSF

Not Good Predictor

44) LowQualFinSF

Not a Good Predictor

45) GrLivArea

Good Predictor

46) BsmtFullBath

Good Predictor

47) BsmtHalfBath

Not Good Predictor

48) FullBath

Good Predictor

49) HalfBath

Good Predictor

50) BedroomAbvGr

Good Predictor

51) KitchenAbvGr

Good Predictor

52) KitchenQual

Good Predictor

53) TotRmsAbvGrd

Not Good Predictor

54) Functional

Good Pradictor

55) Fireplaces

Good Predictor

56) FireplaceQu

Good Pradictor

57) GarageType

Good Predictor

58) GarageYrBlt

Not Good Predictor

59) GarageFinish

Good Predictor

60) GarageCars

GOOD PREDICTOR

61) GarageArea

GOOD PREDICTOR

62) GarageQual

Good Predictor

63) GarageCond

Good Predictor

64) PavedDrive

Good Predictor

65) WoodDeckSF

Not Good Predictor

66) OpenPorchSF

Not Good Predictor

67) EnclosedPorch

Not Good Predictor

68) ThrSsnPorch

Not Good Predictor

69) ScreenPorch

Not Good Predictor

70) PoolArea

Not Good Predictor

71) PoolQC

Not Good Predictor

72) Fence

Good Predictor

73) MiscFeature

Good Predictor

74) MiscVal

Not Good Predictor

75) MoSold

Not Good Predictor

76) YrSold

Not Good Predictor

77) SaleType

Good Predictor

78) SaleCondition

Good Predictor

Good Predictor As per P-Value

Dewll_Type

Zone_Class

Road_Type

Alley

Proparty_Shape

LandContour

LotConfig

Neighborhood

Condition1

Condition2

Dwelling_Type

HouseStyle

OverallQual

OverallCond

RoofStyle

RoofMatl

Exterior1st

Exterior2nd

MasVnrType

ExterQual

ExterCond

Foundation

BsmtQual

BsmtCond

BsmtExposure

BsmtFinType1

BsmtFinType2

TotalBsmtSF

Heating

HeatingQC

CentralAir

Electrical

FirstFlrSF

GrLivArea

BsmtFullBath

FullBath

HalfBath

BedroomAbvGr

KitchenAbvGr

KitchenQual

Functional

Fireplaces

FireplaceQu

GarageType*

GarageFinish

GarageCars

GarageArea

GarageQual

GarageCond

PavedDrive

Fence

MiscFeature

SaleType

SaleCondition

NOT Good Predictor As per P-Value

LotFrontage

LotArea

Utilities

LandSlope

YearBuilt

YearRemodAdd

MasVnrArea

BsmtFinSF1

BsmtFinSF2

BsmtUnfSF

SecondFlrSF

LowQualFinSF

BsmtHalfBath

TotRmsAbvGrd

GarageYrBlt

WoodDeckSF

OpenPorchSF

EnclosedPorch

ThrSsnPorch

ScreenPorch

PoolArea

PoolQC

MiscVal

MoSold

YrSold

OLS Model With All Columns

OLS Model With Good Predictors

OLS Model With P < 0.05 Columns

Condition2

HouseStyle

Exterior2nd

ExterQual

BsmtExposure

BsmtFinType1

HeatingQC

GarageType

GarageFinish

SaleType

Final OLS Model

Linear Regression Assumptions (Health Check)

1 _Normality

make histogram of residuals

And check whether its approximately bell-shaped, symmetrical

2_Linearity

Make scatter plots,

x = each continuous predictor one by one

y = response variable

see whether a linear relationship is visible?

3_Independence of observation

Make scatter plot,

x = obsno

y = residuals

if, no geometric pattern is visible, assumption of INDEPENDENCE OF ERROR is holding good and not violated

4_Constant Error Variance [homoscadasticity]

Make scatter plot,

x = predict

y = residuals

if, no geometric pattern is visible, assumption of homoscadasticity is holding good and not violated

5_VIF

6_Durbin Watson

Final OLS Model (Opt)

MSE

RMSE

As Per Vinod Sir's Script

OLS Model With All Columns

OLS Model With Good predictor Columns

OLS Model With P < 0.05 Columns

Condition2

HouseStyle

Exterior2nd

ExterQual

BsmtExposure

BsmtFinType1

HeatingQC

GarageType

GarageFinish

SaleType

BsmtCond

SaleCondition

Final OLS Model

Test data in two parts

Predictions on test data

Residuals

1 _Normality

make histogram of residuals

And check whether its approximately bell-shaped, symmetrical

2_Linearity

Make scatter plots,

x = each continuous predictor one by one

y = response variable

see whether a linear relationship is visible?

3_Independence of observation

Make scatter plot,

x = obsno

y = residuals

if, no geometric pattern is visible, assumption of INDEPENDENCE OF ERROR is holding good and not violated

4_Constant Error Variance [homoscadasticity]

Make scatter plot,

x = predict

y = residuals

if, no geometric pattern is visible, assumption of homoscadasticity is holding good and not violated

5_VIF

6_Durbin Watson

RMSE: the final verdict!

Plot